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AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

1. (Canceled) 

2. (Currently amended) A method according to Claim 45, wherein the 
step of determining the third value comprises: 

determining, for each of the first plurality of data fields and respective ones of the 
second plurality of data fields, a fourth value based on a mean of a first value 
determined for one of the first plurality of data fields and a second value determined for 
a respective one of the second plurality of data fields; and 

summing the determined fourth values. 

3. (Currently amended) A method for determining a value representing 
a difference between a first record comprising a first plurality of data fields and a second 
record comprising a second plurality of data fields, each of the first plurality of data 
fields corresponding to a respective one of the second plurality of data fields, the 
method comprising: 

for each of the first plurality of data fields, determining a first value representing a 
difference be tween data specified in the data field and data specified in a respective 
one of the second Plurality of data fields: 

for each of the second plurality of data fields, determining a second value 
representing a difference between data specified in the data field and data specified in a 
respective one of the first plurality of data fields: 

determining a third value representing a difference between the first record and 
the second record based on the determined first and second values: and 
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identifying whether the first and second records are duplicates based on the 
determined third value, wherein the determining and Identifying is provided bv a 
processor. 

A mothod aooording to Claim 4 , wherein the stee-ef-determining of the third value 
comprises: 

determining a sum of the determined first values and the determined second 
values; and 

dividing the sum by two. 

4. (Currently amended) A method according to Claim 46, wherein the 
step-ef-determining pfthe first value and the step-ef-determining of the second value 
comprise identical ooerationss teBS performed with respect to different inputs. 

5. (Currently amended) A method for determining a value representing 
a difference between a first record comprising a first plurality of data fields and a second 
record comprising a second plurality of data fields, each of the first plurality of data 
fields corresponding to a respective one of the second plurality of data fields, the 
method comprising: 

for each of the first plurality of data fields, determining a first value representing a 
difference between data specified in the data field and data specified in a respective 
one of the second plurality of data fields; 

for each of the sec»nd,plurality_of ,data fields, determining a second value 
representing a difference between data specified in the data field and data specified in a 
respective one of the first plurality of data fields; 

determining a third value representing a difference between the first record and 
the second record based on the determined first and second values; and 

Identifying whether the first and second records are duplicates based on the 
determined third value, wherein the determining and identifying is provided bv a 
processor. 
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A method accord i ng to Claim 1, w herein the etep-ef-determining pfthe first value 
comprises: 

determining an asymmetric spelling distance as a normalized cost for converting 
first input data to second input data via a sequence of operations; and 
wherein the step of determining the second value comprises: 
determining an asymmetric spelling distance as a normalized cost for converting 
second input data to first input data via the sequence of operations 

6. (Currently amended) A method according to Claim 5, wherein, in the 
etep-ef-deterrnining ofjhe first value, the first input data is data specified in one of the 
first plurality of data fields and the second input data is data specified in a respective 
one ofthe second plurality of data fields, and 

wherein, in the step-ef-determining ofthe second value, the first input data is data 
specified in one ofthe second plurality of data fields and the second input data is data 
specified in a respective one of the first plurality of data fields. 

7. (Currently amended) A method according to Claim 45, further 
comprising: 

converting numerical data specified in the one or more ofthe first plurality of data 
fields and the second plurality of data fields to text data. 

8. (Currently amended) A method according to Claim 45, wherein the 
first plurality of data fields and the second plurality of data fields Include only those fields 
ofthe first record and the second record that specify data that is not identical to data 
specified in a respective field. 

9 - 32. (Canceled) 

33. (Currently amended) An apparatu s m e d i um according to Claim 342, 

wherein the instrucfaonss tee to determine the first value and the instructions s tep-to 
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determine the second value comprise identical steps performed with respect to different 
inputs. 

34. (Currently amended) An apparatus m edium accord i ng to Cl a im 

32gtorjng processor-executable instructions thereon to determine a value representing a 
difference between a first record comprising a first plurality of data fields and a second 
record comprising a second plurality of data fields, each of the first plurality of data 
fields corresponding to a respective one of the second plurality of data fields, the 
instructions comprising: 

instructions to determine, for each of the first plurality of data fields, a first value 
representing a difference between data specified in the date field and data specified in a 
respective one of the second plurality of data fields; 

instructions to determine, for each of the second plurality of data fields, a second 
value representing a difference between data specified in the data field and data 
specified in a respective one of the first plurality of data fields: 

instructions to determine a third value representing a difference between the first 
record and the second record based on the determined first and second va lues' and 

instructions to identify whether the first and second records are duplicates based 
OH the determined third value, 

wherein the instructions s tep-to determine the first value comprises: 

instructions a -stee-to determine an asymmetric spelling distance as a normalized 
cost for converting first input data to second input data via a sequence of operations; 
and 

wherein the instructions step-to determine the second value comprises: 
instructions a -stee-to determine an asymmetric spelling distance as a normalized 
cost for converting second input data to first input data via the sequence of operations. 

35. (Currently amended) An apparatus m odium a ccording to Claim 34, 

wherein, in the instructions e tee-to determine the first value, the first input data is data 
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specified in one of the first plurality of data fields and the second input data is data 
specified in a respective one of the second plurality of data fields, and 

wherein, in the instructions stea-to determine the second value, the first input 
data is data specified in one of the second plurality of data fields and the second input 
data is data specified in a respective one of the first plurality of data fields. 

36. (Currently amended) An apparatus m e dium a ccording to Claim 342, 

wherein the first plurality of data fields and the second plurality of data fields include 
only those fields of the first record and the second record that specify data that is not 
identical to data specified in a respective field. 

37 -38. (Canceled) 

39. (Currently amended) A data warehouse according to Claim 3840, 
wherein the instructions adapted to be executed by the processor to determine the first 
value and to determine the second value comprise identical steps performed with 
respect to different inputs. 

40. (Currently amended) A data warehouse according to C la im 38 x 
comprising: 

a processor; and 

a storage device in communication with the processor and storing instructions 
adapted to be executed bv the processor to: 

determine, for each of a first plurality of data fields of a first record, a first 
value representing a difference between data specified in toe data field and data 
specified In a respective one of a second plurality of data fields of a second 
record. 

determine, for each of the second plurality of data fields, a second value 
representing a difference between data specified in the data field and data 
specified in a respective one of the first plurality of data fields. 
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determine a third value representing a difference between the first record 
and the second record based on the determined first and second values; and 

identify whether the fir st and second records are duplicates based on the 
determined third value., wherein the instructions adapted to be executed by the 
processor to determine the first value comprise instructions adapted to be executed by 
the processor to: 

determine an asymmetric spelling distance as a normalized cost for converting 
first input data to second input data via a sequence of operations; and 

wherein the instructions adapted to be executed by the processor to determine 
the second value comprise instructions adapted to be executed by the processor to: 
determine an asymmetric spelling distance as a normalized cost for converting the 
second input data to the first input date via the sequence of operations. 
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